HMM-based visual speech synthesis using dynamic visemes

نویسندگان

Ausdang Thangthai

Barry-John Theobald

چکیده

In this paper we incorporate dynamic visemes into hidden Markov model (HMM)-based visual speech synthesis. Dynamic visemes represent intuitive visual gestures identified automatically by clustering purely visual speech parameters. They have the advantage of spanning multiple phones and so they capture the effects of visual coarticulation explicitly within the unit. The previous application of dynamic visemes to synthesis used a sample-based approach, where cluster centroids were concatenated to form parameter trajectories corresponding to novel visual speech. In this paper we generalize the use of these units to create more flexible and dynamic animation using a HMM-based synthesis framework. We show using objective and subjective testing that a HMM synthesizer trained using dynamic visemes can generate better visual speech than HMM synthesizers trained using either phone or traditional viseme units.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Visual Speech Representation and HMM Classification for Visual Speech Recognition

This paper presents the development of a novel visual speech recognition (VSR) system based on a new representation that extends the standard viseme concept (that is referred in this paper to as Visual Speech Unit (VSU) and Hidden Markov Models (HMM). The visemes have been regarded as the smallest visual speech elements in the visual domain and they have been widely applied to model the visual ...

متن کامل

Title Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models( Published Version ) Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models

The performance of automatic speech recognition (ASR) system can be significantly enhanced with additional information from visual speech elements such as the movement of lips, tongue, and teeth, especially under noisy environment. In this paper, a novel approach for recognition of visual speech elements is presented. The approach makes use of adaptive boosting (AdaBoost) and hidden Markov mode...

متن کامل

A Novel Visual Speech Representation and HMM Classification for Visual Speech Recognition

This paper presents the development of a novel visual speech recognition (VSR) system based on a new representation that extends the standard viseme concept (that is referred in this paper to as Visual Speech Unit (VSU)) and Hidden Markov Models (HMM). The visemes have been regarded as the smallest visual speech elements in the visual domain and they have been widely applied to model the visual...

متن کامل

Visual Speech Synthesis Using Dynamic Visemes, Contextual Features and DNNs

This paper examines methods to improve visual speech synthesis from a text input using a deep neural network (DNN). Two representations of the input text are considered, namely into phoneme sequences or dynamic viseme sequences. From these sequences, contextual features are extracted that include information at varying linguistic levels, from frame level down to the utterance level. These are e...

متن کامل

Automatic Selection of Visemes for Image-Based Visual Speech Synthesis

An image-based approach provides an eficient way for visual speech synthesis. In an image-based visual speech synthesis system, a few lip images, namely visemes, are used for generating an arbitrary new sentence. Many approaches select visemes manually. In this papel; we propose a method for a system to automatically select visemes by minimizing the synthesis error The feasibility of the propos...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2015

HMM-based visual speech synthesis using dynamic visemes

نویسندگان

چکیده

منابع مشابه

A Novel Visual Speech Representation and HMM Classification for Visual Speech Recognition

Title Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models( Published Version ) Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models

A Novel Visual Speech Representation and HMM Classification for Visual Speech Recognition

Visual Speech Synthesis Using Dynamic Visemes, Contextual Features and DNNs

Automatic Selection of Visemes for Image-Based Visual Speech Synthesis

عنوان ژورنال:

اشتراک گذاری